Vandermonde's identity

For the expression for a special determinant, see Vandermonde matrix.

In combinatorics, Vandermonde's identity, or Vandermonde's convolution, named after Alexandre-Théophile Vandermonde (1772), states that

${m%2Bn \choose r}=\sum_{k=0}^r{m \choose k}{n \choose r-k},\qquad m,n,r\in\mathbb{N}_0,$

for binomial coefficients. This identity was given already in 1303 by the Chinese mathematician Zhu Shijie (Chu Shi-Chieh). See Askey 1975, pp. 59–60 for the history.

There is a q-analog to this theorem called the q-Vandermonde identity.

1 Algebraic proof
2 Combinatorial proof
3 Generalized Vandermonde's identity
4 The hypergeometric probability distribution
5 Chu–Vandermonde identity
6 References

Algebraic proof

In general, the product of two polynomials with degrees m and n, respectively, is given by

$\biggl(\sum_{i=0}^m a_ix^i\biggr) \biggl(\sum_{j=0}^n b_jx^j\biggr) = \sum_{r=0}^{m%2Bn}\biggl(\sum_{k=0}^r a_k b_{r-k}\biggr) x^r,$

where we use the convention that a_i = 0 for all integers i > m and b_j = 0 for all integers j > n. By the binomial theorem,

$(1%2Bx)^{m%2Bn} = \sum_{r=0}^{m%2Bn} {m%2Bn \choose r}x^r.$

Using the binomial theorem also for the exponents m and n, and then the above formula for the product of polynomials, we obtain

$\begin{align} \sum_{r=0}^{m%2Bn} {m%2Bn \choose r}x^r &= (1%2Bx)^{m%2Bn}\\ &= (1%2Bx)^m (1%2Bx)^n \\ &= \biggl(\sum_{i=0}^m {m\choose i}x^i\biggr) \biggl(\sum_{j=0}^n {n\choose j}x^j\biggr)\\ &=\sum_{r=0}^{m%2Bn}\biggl(\sum_{k=0}^r {m\choose k} {n\choose r-k}\biggr) x^r, \end{align}$

where the above convention for the coefficients of the polynomials agrees with the definition of the binomial coefficients, because both give zero for all i > m and j > n, respectively.

By comparing coefficients of x^r, Vandermonde's identity follows for all integers r with 0 ≤ r ≤ m + n. For larger integers r, both sides of Vandermonde's identity are zero due to the definition of binomial coefficients.

Combinatorial proof

Vandermonde's identity also admits a more combinatorics-flavored double counting proof, as follows. Suppose a committee in the US Senate consists of m Democrats and n Republicans. In how many ways can a subcommittee of r members be formed? The answer is of course

${m%2Bn \choose r}.$

But on the other hand, the answer is the sum over all possible values of k, of the number of subcommittees consisting of k Democrats and r − k Republicans.

Generalized Vandermonde's identity

If in the algebraic derivation above more than two polynomials are used, it results in the generalized Vandermonde's identity. For y + 1 polynomials:

$\sum_{k_1,\dots,k_y = 0}^x {n\choose k_1} {n\choose k_2} {n\choose k_3} \cdots {n \choose x - \sum_{j = 1}^y k_j } = { \left( y %2B 1 \right) n \choose x}.$

The hypergeometric probability distribution

When both sides have been divided by the expression on the left, so that the sum is 1, then the terms of the sum may be interpreted as probabilities. The resulting probability distribution is the hypergeometric distribution. That is the probability distribution of the number of red marbles in r draws without replacement from an urn containing n red and m blue marbles.

Chu–Vandermonde identity

The identity generalizes to non-integer arguments. In this case, it is known as the Chu–Vandermonde identity (see Askey 1975, pp. 59–60) and takes the form

${s%2Bt \choose n}=\sum_{k=0}^n {s \choose k}{t \choose n-k}$

for general complex-valued s and t and any non-negative integer n. This identity may be re-written in terms of the falling Pochhammer symbols as

$(s%2Bt)_n = \sum_{k=0}^n {n \choose k} (s)_k (t)_{n-k}$

in which form it is clearly recognizable as an umbral variant of the binomial theorem. (For more on umbral variants of the binomial theorem, see binomial type) The Chu–Vandermonde identity can also be seen to be a special case of Gauss's hypergeometric theorem, which states that

$\;_2F_1(a,b;c;1) = \frac{\Gamma(c)\Gamma(c-a-b)}{\Gamma(c-a)\Gamma(c-b)}$

where $\;_2F_1$ is the hypergeometric function and $\Gamma(n%2B1)=n!$ is the gamma function. One regains the Chu–Vandermonde identity by taking a = −n and applying the identity

${n\choose k} = (-1)^k {k-n-1 \choose k}$

liberally.

References

Askey, Richard (1975), Orthogonal polynomials and special functions, Regional Conference Series in Applied Mathematics, 21, Philadelphia, PA: SIAM, pp. viii+110